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Abstract 

Following up on a paper of Balamohan, Kuznetsov, and Tanny, we analyze a variant 
of Hofstadter's Q-sequence and show it is 2-automatic. An automaton computing the 
sequence is explicitly given. 

1 Introduction 

In his 1979 book Godel, Escher, Bach [7], Douglas Hofstadter introduced the sequence Q{n) 
defined by the recursion 

Q{n) = Q{n - Q{n - 1)) + Q{n - Q{n - 2)) 

for n > 2 and Q{1) = Q{2) = 1. Although it has been studied extensively (e.g., [TO]), 
still little is known about its behavior, and it is not mentioned in standard books about 
recurrences (e.g., [6]). It is sequence A005185 in Sloane's Encyclopedia jT2] . 

Twenty years later, Hofstadter and Huber introduced a family of sequences analogous to 
the Q-sequence, and defined by the recursion 

Qr,s{n) = Qr,s{n - Qr,s{n -r)) + Qr,s{n - Qr,s{n - s)) 

for n > s > r [8]. The case r = 1, s = 4 is of particular interest. 

Recently Balamohan, Kuznetsov and Tanny [3] gave a nearly complete analysis of the 
sequence Qi 4 (called V in their paper). It is defined by 

V{1) = V{2) = V{3) = 1/(4) = 1, and Vn > 4, V{n) := V{n-V{n-1)) + V{n-V{n-4)). 
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Here is a short table of the sequence V (sequence A063882 in Sloane's Encyclopedia [12]). 
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Among the results of Balamohan, Kuznetsov, and Tanny is a precise description of the 
"frequency" sequence F{n) defined by 



Here is a short table of the sequence F (sequence A132157 in Sloane's Encyclopedia [T2]). 
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In particular they proved the following theorem [U Lemmas 13-19 and Table 5]. 

Theorem 1 (Balamohan, Kuznetsov, Tanny). There exist two (explicit) maps g,h, with 
g, h : {1, 2, 3}"^ — )■ {1, 2, 3}, such that, for all a > 3 



(We note that in Lemma 13 of [4j, the quantifiers a > 3 for the equality F{2a) = 2 and 
a > 4 for the equality F{2a + 1) = 2 should have been mentioned.) 

In this paper we prove that the sequence (F(n))„>i is 2-automatic, which means essen- 
tially that F{n) can be computed "in a simple way" from the base-2 representation of n — 
in particular, it can be computed in O(logn) time. Furthermore, we give the automaton 
explicitly. For definitions and properties of automatic sequences, the reader is referred to 
[2] . For some recent related papers, see [9l |5l [11] . 



We begin this section with a general result on automatic sequences. Before stating the 
theorem we need a notation. 

Definition 2. Let W = {W{n))n>o be a sequence. Let a be an integer in Z. We let W"' 
denote the sequence defined, for n > —a, by 



F(a) := V{n) = a}. 



F(2a) 
F(2a + 1) 



r/(F(a-2),F(a-l),F(a),F(a + l)) 
h{F{a - 2), F{a - 1), F(a), F{a + 1)). 



2 The main result 



iy"(n) := W{n + a). 
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Definition 3. Let c be an integer > 0. If a sequence {W{n))n>o is only defined for n > c, 
we assume tliat the values of W{n) for n G [0, c) are arbitrary. 

Theorem 4. Let {U{n))n>o be a sequence with values in a finite set A. Let q > 2 be an 
integer. Suppose that there exist four nonnegative integers t, a, b, uq, and g*"^^ functions from 

the set A"''^ to A, denoted /o, fi, ■ ■ ■ , fqt+i-i, such that Vj G [0, q^^^ — 1] and Vn > Uq 

U{q^+^n + j) = 

fjiU--{n), U'\n), U%n), U\n), U\n), U^in), Usin), U ,j+^{n)) 

3-1 

where Ui = U, U2, ■ ■ ■ , U qt+i-i are the subsequences {U{q^n + j))„>o with i G [0,t] and j G 

9-1 

[0,g* — 1], written in some fixed order. Then the sequence {U{n))n>o is q-automatic. 

Before proving this theorem we recall the Euclidean division of an integer in Z by a 
positive integer. 

Lemma 5. Let S be an integer in Z and Q be a positive integer. Then there exist X G Z 
and an integer Y , < Y < Q , such that S = QX + Y . 

Proof Let X = [S/Q\ and F = 5 - Q [S/Q\ . Then < S/Q - [S/Q\ < 1, so, multiplying 
by Q, we get < F < Q. □ 

Proof of Theorem^ To prove that the sequence U = {U{n))n>o is g-automatic, it suffices 

to find a finite set of sequences S that contains U, such that if V = (y{n))n>o belongs to £, 

then, for any r G [0, g — 1] the sequence (y{qn + r))„>o also belongs to £. Fix two positive 

integers A and B such that A > max(no, ^^iy^) and B > ^^i^- Recall that Ui = U, 

U2, . . . , U qt+i^i are the sequences {U{q^n + j))n>o with i G [0,t] and j G [0,g* — 1]. Also 
9-1 

recall that the sequence Uj^ is defined by U^{n) := Uk{n + a). Let S be the (finite) set of 
sequences defined by 

V eS^3£e[l, i 3A; G [-A, S], Vn > A, V{ri) = U^{n). 

Now let be a sequence in S. Take r G [0, g — 1] . There exist £ G [1, ^-^z^] and k G [—A, B] 
such that for all n > A, we have 

V{qn + r) = U^{qn + r) = Ui{qn + r + k). 

Hence for some i <t and j G [0, g* — 1] 

V{qn + r) = U{q\qn + r + k) + j) . 

Write g*(r + k) + j = q^^^x + y, with a; G Z and y G [0, q^^^ — 1], so that 

V{qn + r) = U{q'+^{n + x) + y). 
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Note that 

g'+^x < q'~^^x + y = q\r + k) + j < q\r + k + l) 

and 

= q\r + k) + j - y > q\r + k) - 

Hence 



r + k — q r + + 1 
< X < 



We distinguish two cases. 



Case 1: i < t. Then i + 1 < t. Thus there exists £' G [1, ^-^zj^] such that, for n > A, 

V{qn + r) = U{q'^'^{n + x) + y) = Ue{n + x) = U^u,{n). 
Now X > '-^^ > > > -A (since A > > and x < < < B 

q — q — q — V — q-1 — g-l'" g — q — 

(since B > ^^^^p > This shows that the sequence {V{qn + r))„>o belongs to £. 

Case 2: i = t. Then i + 1 = t + 1. From the hypothesis and the condition A > no, we can 
write, for n > A, 

V{qn + r) = U{q*~^^{n + x) + y) = 

To prove that the sequence {y{qn + r))n>o belongs to £, it suffices to prove that all sequences 
for /3 G [x — a, X + h] and all sequences for £ G [1, "^^ "^ J belong to and to use 
composition of maps. But we have 

r + k — q —A — q 

p > X — a > a > a > —A 

q q 

(recall that A > and 

r + fc + 1 , q + B , 
P<x + b< +b< ^^^— + b<B 

q q 

(recall that B > ^^^i^). This implies that all sequences occurring in the arguments of fy 
above belong to □ 

Remark 6. Theorem H] above is similar to (but different from) j3], Theorem 6, p. 5] on k- 
regular sequences. That theorem implies Theorem H] above in the case where the maps fj 
are linear. 

Corollary 7. The sequence F = (F(n))„>o is 2-automatic. 

Proof. It suffices to use the theorem recalled in the first section, after having extended the 
sequence F by F{Q) = 0. □ 
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3 An explicit automaton 



In this section we provide an explicit automatoiu to calculate the sequence F. 

The automaton is constructed in two stages. First, we give an automaton A with the 
property that reading n in base 2 takes us to a state q with the property that the four 
values F{n + a) for —2 < a < 1 are completely determined by q. Next, we show that A 
can be minimized to give an automaton B computing F{n). We remark that we assume 
throughout that the automaton reads the ordinary base-2 representation of n from "left to 
right", ending at the least significant digit, although we do allow the possibility of leading 
zeros at the start. 

Let us start with the description of A = {Q, S, A, 6, go, t). The machine A has 33 states 
with strings as names; E = {0, 1}; A = {0, 1, 2, 3, 4}^, go = The transition function 6 and 
the output map r are given in Table [T] below. 

We introduce some notation. Let [w] denote the integer represented by the binary string 
w in base 2. Thus, for example, [00110] = [110] = 6. Note that [e] = 0, where e denotes the 
empty string. If F is our sequence defined above, then by F{a..a + i — we mean the string 
of length i given by the values of the function F at a, a + 1, . . . , a + i — 1. 

Our intent is that if w is a binary string, then T{S{qo,w)) is the string of length 4 given 
by F{n - 2..n + 1), where n = [w]. (Note that we define F(0) = F(-l) = F(-2) = 0.) 

To prove that this automaton computes F{n) correctly, it suffices to show that 

(a) for each state g we have r(g) = F([g] - 2)F([g] - l)F([g])F([g] + 1); and 

(b) a p = S{q,a) for two states p,q E Q and a G {0, 1}, then F{[px]) = F{[qax]) for all 
strings x. 

Part (a) can be verified by a computation, which we omit. For example, since [111001111] = 
463, the claim r(lllOOllll) = 2133 means F(461..464) = 2133, which can easily be checked. 

Part (b) requires a tedious simultaneous induction on all the assertions, by induction on 
\x\. Not surprisingly, we omit most of the details and just prove a single representative case. 

Consider the transition 5(100, 1) = 110. Here we must prove that 

F([1001x]) = F([110a;]) (1) 

for all strings x. We do so by induction on x. The base case is a; = e, and we have 
F([1001]) = F(9) = 2 and F([110]) = F(6) = 2. 

For the induction step, we use the fact that [H Table 5] shows that F{2a) and F{2a + 1) 
is completely determined by F{a — 2), F{a — 1), -F(ci), and F{a + 1). It thus suffices to check 
that F([1001x] + a) = F([110x] + a) for —2 < a < 1; doing so will then prove ([1]) for xO and 
xl, thus completing the induction. 

The only cases that require any computation are when [x\ = and a = — 1, —2, or [x] = 1 
and a = —2, or x is a number of the form 2^ — 1 for some J > 1 and a = 1. 

^In honor of Alf van der Poorten, we cannot resist quoting Voltaire: "Impuissantes machines/ Automates 
pensants mus par des mains divines." 
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Case 1: X = O-' for some j > 0. If j = then this is the assertion that F([1001] + a) — 
F([110] + a) for -2 < a < 1, which is the same as the claim that F(7..10) = F(4..7). But 
F(7..10) = 1221 = F(4..7). 



Table 1: The automaton A 
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Otherwise j > 1. Then [lOOlx] - 1 = [lOOlO-'] - 1 = [lOOOP] and [llOx] - 1 = [llOO-'] - 
1 = [1011-']. Now by induction we have F([1000P]) = F{[10001V~^]) = F{[1011V-^]) = 
F{[101V]), as desired. 

Similarly, [1001x]-2 = [10010^] -2 = [lOOP-^O]. Also [110a;] -2 = [1100-']-2 = [lOPO]. 
Then by induction we have F{[100V-^) = ^([lOOlP-^O]) = ^([lOlll-'-^O]) = F([101-'0]), 
as desired. 

Case 2: x = On for some j > 0. Then [lOOlx] - 2 = [lOOlOn] - 2 = [lOOOP+i]. Also 
[110a;] - 2 = [noon] - 2 = [lOP+2]. By induction we have F([1000P+i]) = F([100011-']) = 
F([1011P]) = F([10P+2], as desired. 

Case 3: x = V for some j > 1. Then [1001x] + l = [lOlO^+i]. Similarly [110x] + l = [110P] + 
1 = [1110^]. By induction we have F([1010-'+i]) = F([101000-'-i]) = F([11100-'-i]) = 
F([1110-']), as desired. 

This completes the proof of correctness of a single transition. 

Ultimately, we are not really interested in computing T{q), but only the image of r(g) 
formed by extracting the third component, which is the one corresponding to F{n). This 
means that we can replace r by r', which is the projection of r along the third component. 
In doing so some of the states of A become equivalent to other states. We can now use the 
standard minimization algorithm for automata to produce the 20-state minimal automaton 
B = {Q', S, A, 6', go, t') computing F{n). Table [2] below gives the names of the states of Q, 
and S' and r' for these states. 

4 Concluding remarks 

It would be interesting to know whether the first difference sequence of the variant of Hof- 
stadter's, i.e., the sequence {V{n + 1) — V{n))n>o, is also 2-automatic. We already know 
that it takes only finitely many values [H Theorem 1, page 5]. Of course it might well be 
the case that this sequence is not automatic: in a very different context, think of the clas- 
sical Thue- Morse sequence which is 2-automatic, but whose runlength sequence is not [T]. 
It would be also interesting to determine for which sequences Qr,s (with the notation in the 
introduction) the frequency sequence is automatic. 
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Table 2: The automaton B 
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1010 
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2 
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1011 


11100111 


3 


11100111 


11100 


1110011 


2 
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